Optimizing Text Categorization for Indonesian Text Using Clustering Label Technique
نویسندگان
چکیده
منابع مشابه
Automatic Text Categorization Using Trend-Tracking Technique
In this paper, we propose a novel text categorization method using trendtracking technique. The method classies texts by tracking the transition of information in them. Therefore, it can deal especially well with texts whose content transits gradually with the passage of time, such as Internet news articles, newspaper articles, or web pages which are often updated. Experimental results show tha...
متن کاملImproving Methods for Single-label Text Categorization
As the volume of information in digital form increases, the use of Text Categorization techniques aimed at finding relevant information becomes more necessary. To improve the quality of the classification, I propose the combination of different classification methods. The results show that k-NN-LSI, the combination of k-NNwith LSI, presents an average Accuracy on the five datasets that is highe...
متن کاملSelection Strategies for Multi-label Text Categorization
In multi-label text categorization, determining the final set of classes that will label a given document is not trivial. It implies first to determine whether a class is suitable of being attached to the text and, secondly, the number of them that we have to consider. Different strategies for determining the size of the final set of assigned labels are studied here. We analyze several classifi...
متن کاملTwo-dimensional Clustering for Text Categorization
We propose a new method to improve the accuracy of Text Categorization using twodimensional clustering. In a number of previous probabilistic approaches, texts in the same category are implicitly assumed to be generated from an identical distribution. We empirically show that this assumption is not accurate, and propose a new framework based on twodimensional clustering to alleviate this proble...
متن کاملAutomatic Word Clustering for Text Categorization Using Global Information
This paper presents a cluster-based text categorization system which uses class distributional clustering of words. We propose a new clustering model which considers the global information over all the clusters. The model can group words into clusters based on the distribution of class labels associated with each word. Using these learned clusters as features, we develop a cluster-based classif...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Turkish Journal of Computer and Mathematics Education (TURCOMAT)
سال: 2021
ISSN: 1309-4653
DOI: 10.17762/turcomat.v12i3.947